Learning discrete Bayesian models for autonomous agent navigation

نویسندگان

  • Daniel Nikovski
  • Illah R. Nourbakhsh
چکیده

Partially observable Markov decision processes (POMDPs) are a convenient representation for reasoning and planning in mobile robot applications. We investigate two algorithms for learning POMDPs from series of observation/action pairs by comparing their performance in fourteen synthetic worlds in conjunction with four planning algorithms. Experimental results suggest that the traditional Baum-Welch algorithm learns better the structure of worlds speci cally designed to impede the agent, while a bestrst model merging algorithm originally due to Stolcke and Omohundro [?, SO]erforms better in more benign worlds, including such that model typical real-world robot fetching tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A ’Cognitive Driving Framework’ for Collision Avoidance in Autonomous Vehicles

The Cognitive Driving Framework is a novel method for forecasting the future states of a multi-agent system that takes into consideration both the intentions of the agents as well as their beliefs about the environment. This is particularly useful for autonomous vehicles operating in an urban environment. The algorithm maintains a posterior probability distribution over agent intents and belief...

متن کامل

A Study of Autonomous Agent Navigation Algorithms in the ANIMAT Environment

This paper is dealt with the examination of possibilities of the algorithm of autonomous agent navigation. The ANIMAT problem was proposed to study the possibilities of ZCS classifier system learning. A ZCS classifier system is purely reactive. For this system it is essential to have a possibility to identify the global state of the environment by the input message. This shortens the applicatio...

متن کامل

Multiagent Planning with Bayesian Nonparametric Asymptotics

Autonomous multiagent systems are beginning to see use in complex, changing environments that cannot be completely specified a priori. In order to be adaptive to these environments and avoid the fragility associated with making too many a priori assumptions, autonomous systems must incorporate some form of learning. However, learning techniques themselves often require structural assumptions to...

متن کامل

Effective Mechatronic Models and Methods for Implementation an Autonomous Soccer Robot

  Omni directional mobile robots have been popularly employed in several applications especially in soccer player robots considered in Robocup competitions. However, Omni directional navigation system, Omni-vision system and solenoid kicking mechanism in such mobile robots have not ever been combined. This situation brings the idea of a robot with no head direction into existence, a comprehensi...

متن کامل

Nonparametric Bayesian Learning of Other Agents? Policies in Interactive POMDPs

We consider an autonomous agent facing a partially observable, stochastic, multiagent environment where the unknown policies of other agents are represented as finite state controllers (FSCs). We show how an agent can (i) learn the FSCs of the other agents, and (ii) exploit these models during interactions. To separate the issues of off-line versus on-line learning we consider here an off-line ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999